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Abstract 

We study quenched distributions on random walks in a random potential on in- 
teger lattices of arbitrary dimension and with an arbitrary finite set of admissible 
steps. The potential can be unbounded and can depend on a few steps of the 
walk. Directed, undirected and stretched polymers, as well as random walk in 
random environment, are covered. The restriction needed is on the moment of 
the potential, in relation to the degree of mixing of the ergodic environment. We 
derive two variational formulas for the limiting quenched free energy and prove 
a process-level quenched large deviation principle for the empirical measure. As 
a corollary we obtain LDPs for types of random walk in random environment 
not covered by earlier results. 



1 Introduction 

This paper investigates the limiting free energy and large deviations for several 
much-studied lattice models of random motion in a random medium. These include 
walks in random potentials, also called polymer models, and the standard random 
walk in random environment (RWRE). We derive variational formulas for the free 
energy and process-level large deviations for the empirical measure. 



1.1 Walks in random potentials and environments 

We call our basic model random walk in a random potential (RWRP). A special 
case is random walk in random environment (RWRE). Fix a dimension d 6 N. 
There are three ingredients to the model: (i) a reference random walk on Z d , (ii) 
an environment, and (iii) a potential. 

(i) Fix a finite subset M C Let P x denote the distribution of the discrete 
time random walk on 7L d that starts at x and has jump probability p(z) = for 
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J 1 and p(z) = otherwise. E x is expectation under P x . The walk is denoted by 
Xo» = (X„)„>o. Let 'S be the additive subgroup of Z d generated by Si. 

(ii) An environment ft) is a sample point from a probability space (£2, 6,P). Q. 
comes equipped with a group {T z : z S 5^} of measurable commuting bijections that 
satisfy T x+y = T x T y and 7b is the identity. P is a {71 : z G ^-invariant probability 
measure on (£2,(5) that is ergodic under this group. In other words, if A £ & 
satisfies T Z A =A for all z € <$ then P(A) = or 1. E will denote expectation relative 
to P. We call (£2, S,P, {T z : z € &}) a measurable ergodic dynamical system. 

(iii) A potential is a measurable function V : £1 x ^ — >• R, for some integer 
£>0. 

Given an environment ft) and a starting point x € Z rf , for « > 1 define the 
quenched polymer measures 

(i.i) qSH*),- G A > = 4^ £ 4^^ y(7i<CBA+u+f)l A^o,~)] 

normalized by the quenched partition function 

z V,w = E ^ e -r k ZlV(T Xk w,Z k+hM )-j = £ e -Y.V V(T Xk w,z k+u+c ) _ 

%k =Xk — is a step of the walk and vectors are X,j = (X,-,X i+ i , ...,Xj). Qn® 
represents the evolution of the polymer in a "frozen" environment ft). (The picture 
is that of a heated sword quenched in water.) Let us mention two models of special 
importance. 

Example 1.1 (k+ 1 dimensional directed polymer in a random environment). Take 
the canonical setting: product space £2 = r z</ with generic points ft) = {co x ) xe id 
and translations (T x co) y = (O x+y . Then let d = k + 1, V(<0) = — /3ft)o with inverse 
temperature parameter /3, ^ = {e, : 1 < i < k}, and the coordinates {ft) r } 

i.i.d. under P. Thus the projection of the walk on Z* is simple random walk, and at 
every step the walk sees a fresh environment. 

Example 1.2 (Random walk in random environment). RWRE is a Markov chain 
X n on Z d whose transition probabilities are determined by an environment ft) £ £2. 
Let & = {(p z )ze3l G [0) 1]"* : LzPz = 1} be the set of probability distributions on 
M and : £2 — s- ^ a measurable function with p{co) = (ft(w)) ; e,«- A transition 
probability matrix is denned by 

^, V M = | for^EZ. 

Given ft) and jc € Z rf , P x is the law of the Markov chain Xq ^ = (X n ) n >o on Z rf 
with initial point X$ = x and transition probabilities n %z {(o). That is, P x satisfies 
P®{X Q =x} = 1 and 

lf{Xn +1 = Z \X n =y} = K y , z ((0) for all y,z G Z rf . 
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P® is called the quenched distribution of the walk X n . The averaged (or annealed) 
distribution is the path marginal P x (-) = J P®(-)P(d(o) of the joint distribution 
P x (dxo,oo,dco) =P® (dxo^idco). 

RWRE is a special case of (1.1) with V((D,zi,e) = — log^o )Zl (£0). (Note the 
abuse of notation: for RWRE Pq is the averaged measure while in RWRP Pq is the 
reference random walk. This should cause no confusion.) 

Of particular interest are RWREs where lies outside the convex hull of St. 
These are strictly directed in the sense that for some u G M. d , z ■ u > for each 
admissible step z££ General large deviation theory for these walks is covered 
for the first time in the present paper. 

1.2 Results 

We have two types of results. First we prove the P-a.s. existence of the quenched 
free energy 

(1.2) lim n~ l logZ^ = lim rT l log£ \ e -^=o v ^ k w - z M^] 

and derive two variational formulas for the limit. The assumption we need com- 
bines moment bounds on V with the degree of mixing in P: if P is merely ergodic 
we require a bounded V, while with independence or exponential mixing LP for 
p > d is sufficient. The existence of the limit is not entirely new because in some 
cases it follows from subadditive methods and concentration inequalities. In Ex- 
ample 1.1 [8] proved the limit under an exponential moment assumption and [47] 
with the tail assumption under which greedy lattice animals are known to have 
linear growth. Our variational descriptions of the free energy are new. 

The second results are large deviation principles (LDPs) for the quenched dis- 
tributions Q^'q{R„ € • } of the empirical process 

n-l 

R n = n ~~ 52 ^CO.Zjt+l.co- 
k=0 

Tx k co is the environment seen from the current position of the walk and Zjt+i,*. = 
(Z,),t+i<,<oo is the entire sequence of future steps. We assume £2 separable metric 
with Borel a-algebra (3. Distributions Q^'q{R™ G • } are probability measures on 

(n x M ), the space of Borel probability measures onflx & m endowed with 
the weak topology generated by bounded continuous functions. 

The LDP takes this standard form. There is a lower semicontinuous convex rate 
function 7j 3 : Jt\ (£2 x — > [0,°°] such that these bounds hold: 

lim logo!/,? |/C G C| < — inf I^-i(ll) for all compact sets C 
and lim n~ l log Q^'n{R^ € 0} > — inf /J 3 (/x) for all open sets O. 

Large deviations of are called level 3 or process level large deviations. For basic 
large deviation theory we refer the reader to [ 1 1], [12], [14], [31], and [45]. 
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Since we prove the upper bound only for compact sets the result is technically 
known as a weak LDP. In the important special case of strictly directed walk in an 
i.i.d. environment we strengthen the result to a full LDP where the upper bound 
is valid for all closed sets. Often Q. is compact and then this issue vanishes. As a 
corollary we obtain large deviations for RWRE. 

This paper does not investigate models that allow V = °°. An example in RWRE 
would be a walk on a supercritical percolation cluster. 

1.3 Overview of literature and predecessors of this work 

Random walk in random environment was introduced by Chernov [6] in 1967 
and Temkin [44] in 1972 as a model for DNA replication. Random walk in ran- 
dom potential appeared in the work of Huse and Henley [24] in 1985 on impurity- 
induced domain-wall roughening in the two-dimensional Ising model. The seminal 
mathematical work on RWRE was Solomon 1975 [39] and on RWRP Imbrie and 
Spencer 1988 [25] and Bolthausen 1989 [3]. Despite a few decades of effort many 
basic questions on (i) recurrence, transience and zero-one laws, (ii) fluctuation be- 
havior and (iii) large deviations remain only partially answered. Accounts of parts 
of the state of the art can be found in the lectures [4], [23], [43] and [50] on RWRE, 
and in [9], [13], [21], [36] and [42] on RWRP. 

Our LDP Theorem 3.1 specialized to RWRE covers the quenched level 1 LDPs 
for RWRE that have been established over the last two decades. In the one- 
dimensional case Greven and den Hollander [22] considered the i.i.d. nearest- 
neighbor case, Comets, Gantert, and Zeitouni [7] the ergodic nearest-neighbor 
case, and Yilmaz [49] the ergodic case with M = {z \z\ < M} for some M. 
In the multidimensional setting Zerner [52] looked at the i.i.d. nearest-neighbor 
nestling case, and Varadhan [46] the general ergodic case with bounded step size 
and {z : \z\ = 1} C St. All these works, with the exception of [52], required uni- 
form ellipticity at least on part of i.e. %q z > K for a fixed K > and all z with 
\z\ = 1. [52] needs E^log^T'] < °°, still for all \z\ = 1. Rosenbluth [35] gave 
a variational formula for the rate function in [46] under an assumption of p > d 
moments on log7To, z , |z| = 1. 

Article [32] proved a quenched level 3 LDP for RWRE under a general ergodic 
environment, subject to bounded steps, p > d moments on log TtQ tZ , and an irre- 
ducibility assumption that required the origin to be accessible from every x G T, d . 
(See Remark 2.7 for more technical explanation of the scope of [32] compared to 
the present paper.) Level 3 large deviations for RWRE have not appeared in other 
works. [49] gave a quenched univariate level 2 LDP. This means that the path 
component in the empirical measure has only one step: n~ x ®A+r 

One goal of the present paper is to eliminate the unsatisfactory irreducibility 
assumption of [32]. This is important because the irreducibility assumption ex- 
cluded several basic and fruitful models, such as directed polymers, RWRE in a 
space-time, or dynamical, environment (the case C {x : x ■ e\ = 1}) and RWRE 
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with a forbidden direction (the case 3% C {x : x ■ u > 0} for some u ^ 0). Corollary 
A.3 shows that forbidden direction is the only case not covered by [32], but [32] 
did not address the more general polymer model. 

Our results cover the quenched level 1 LDPs for space-time RWRE derived in 
[48] for i.i.d. environment in a neighborhood of the asymptotic velocity and by 
Avena, den Hollander, and Redig [2] for a space-time random environment given 
by a mixing attractive spin flip particle system. Our results can also be adapted 
to continuous time to cover the quenched level 1 LDP by Drewitz et al. [15] for a 
random walk among a Poisson system of moving traps. 

On the RWRP side, Theorems 2.3 and 3. 1 cover, respectively, the existence of 
free energy and the quenched level 1 LDPs for simple random walk in random 
potential proved by Zerner [51] and the corresponding results for directed simple 
random walk in random potential proved by Carmona and Hu [5] and Comets, 
Shiga, and Yoshida [8]. (See [41] for an earlier continuous counterpart of [51].) 
We also give an entropy interpretation for the rate function and two variational 
formulas for the free energy, while earlier descriptions of these objects came in 
terms of Lyapunov functions and subadditivity arguments. As far as we know, 
level 2 or 3 large deviations have not been established in the past for RWRP. 

The technical heart of [32] was a multidimensional extension of a homoge- 
nization argument that goes back to Kosygina, Rezakhanlou, and Varadhan [27] in 
the context of diffusion in time-independent random potential. This argument was 
used by Rosenbluth [35] and Yrlmaz [49] to prove LDPs for RWRE. 

The main technical contribution of the current work is a new approach to the 
homogenization argument that allows us to drop the aforementioned irreducibility 
requirement. One comment to make is that this construction that we undertake in 
Appendix C does use the invertibility of the transformations T z assumed in Section 
1.1. This is the only place where that is needed. 

The homogenization method of [27] was shaipened by Kosygina and Varadhan 
[28] to handle time-dependent but bounded random potentials. The results in [27] 
and [28] concerned homogenization of stochastic Hamilton- Jacobi-Bellman equa- 
tions and yielded variational formulas for the effective Hamiltonian. For a special 
case of the random Hamiltonian one can convert these results into quenched large 
deviations for the velocity of a diffusion in a random potential, with variational 
formulas for the quenched free energy. Using different methods, [29] and [1] ob- 
tain homogenization results similar to [27] and [28], respectively. Furthermore, [1] 
allows unbounded potentials and requires mixing to compensate for the unbound- 
edness; compare with part (d) of our Lemma A.4. It is noteworthy that when d=\, 
an ergodic L 1 potential is in fact enough; see [18] and compare with part (b) of our 
Lemma A.4. 

We end this section with some conventions for easy reference. For a mea- 
surable space (3£ ,38), ^t\{3£) is the space of probability measures on SC and 
J2(3f) the set of Markov transition kernels on X . Given \i € J%\{3£) and 
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q e J2( 3£), pxqis the probability measure on X x X defined by /I x q[A xB) = 
J lA{x)q(x,B) p(dx) and pq is its second marginal. E^[f] denotes expectation of 
/ under probability measure p. The increments of a path (x, ) in Z rf are denoted by 
Zi = Xi — jc/_i. Segments of sequences are denoted by Zij = (zi,Zi+i, . . . ,Zj), also 
for j = oo. 

2 Variational representations for free energy 

Standing assumptions in this section are that (Q,@,P, {T z : z € is a mea- 
surable ergodic dynamical system and, as throughout the paper, ffl is an arbitrary 
finite subset of Z rf that generates the additive group . These will not be repeated 
in the statements of lemmas and theorems. Most of the time we also assume that 
& is countably generated, this will be mentioned. The relevant Markov process 
for this analysis is (7x n ft),Z, l+ i „ + ^) with state space Clt = £1 x SI . The evolution 
goes via the transformations S^(co,z\,e) = (T Zl co,(z2,e,z)) on £l( where the step 
z is chosen randomly from St as stipulated by the kernel p. Elements of Cle are 
abbreviated 7] = (tt), Zi,e). 

We first look at the limiting logarithmic moment generating function (1.2), also 
called the pressure or the free energy. To cover much-studied directed polymer 
models it is important to go beyond bounded continuous potentials. To achieve 
this, and at the same time provide a succinct statement of a key hypothesis for 
Lemma 2.8 below, we introduce class in the next definition. Let 

(2.1) D„ = {zi + • ■ • +z n e 1 d : zi,„ G 

denote the set of points accessible from the origin in exactly n steps from Si. 

Definition 2.1. A function g : CI —> M. is in class Jz? if^GL 1 (P)andfor any nonzero 

z^Si 

lim lim max - V" |gor A+ ,-| = P-a.s. 
e ^0 xeu'- =0 D k n Q ^ en 

Similarly, a function g on Qf is a member of if g(- ,Zi.e) € ££ for each zi t e € St . 

A bounded g is in ^£ under an arbitrary ergodic P, and so is any g G L 1 (P) if 
d = 1. In general there is a trade-off between the degree of mixing in P and the 
moment of g required. For example, if sufficiently separated shifts of g are i.i.d. 
or there is exponential mixing, then g G Z/(P) for some p > d guarantees g € % '. 
Under polynomial mixing a higher moment is needed. Lemma A.4 in Appendix A 
collects sufficient conditions for membership in Jzf . 

We have two variational formulas for the free energy. One is duality in terms 
of entropy. The other involves a functional K((g) denned by a minimization over 
gradient-like auxiliary functions. Class below is a generalization of a class of 
functions previously introduced by [35]. 

Definition 2.2. A measurable function F : Cie x Si — > R is in class if it satisfies 
the following three conditions. 
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(i) Integrability: for each zi,e G St, 1 and zGJ 1 , E[|F(fi),zi^,z)|] < ~. 

(ii) Mean zero: for all n > £ and {a,}" =1 G the following holds. If T] 
(co,a n -? +hn ) and 7],- = 5+TJ/_i for i = l,...,n, then 



E 



n-l 



1=0 



In other words, expectation vanishes whenever the sequence of moves 
S% ,5+ takes (a),zi,e) to (T x m,zi,e) for all oo, for fixed * and zi,e- 
(iii) Closed loop: for P-a.e. oo and any two paths {f];}"_ and {?)_/} wj with 
Ho = Ho = T]„ = f) m , T]; = S+ifc-i, and f) 7 - = Sjffc-_i, for i, j > 

and some {<?;}? =1 G 3Z n and {a 7 }'j' =1 G we have 

n— 1 m— 1 

^F(TJ i -,a i+ i) = £ F(T)y,fij + i). 
1=0 7=0 

In case of a loop (T]o = in (iii) above one can take m = and the right-hand 
side in the display vanishes. The simplest members of Jfg are gradients F(t],z) = 
h(S^ri) —h{r\) with bounded measurable h : €1^ — > R. Lemma C.3 in the appendix 
shows that is the L 1 (P)-closure of such gradients. 

For F G Xt and g-.a e ^R such that g(- ,zi,i) G L 1 (P) for all zi^ G ^ define 

JQ^fe) = P-ess sup max log £ -LgrtD+f'fo*) 



and then 



^(g) = inf tf^(g). 



The reference walk /> with uniform steps from Si defines a Markov kernel p( 
on Cl( by 



(2.2) ^(77,5+7]) = ^ forzG^andT] = (t»,z M ) € 



Let /^o denote the £2-marginal of a measure pL G Define an entropy Hf 

on (Cle) by 



(2.3) 

inf{//(jU xq\fixp e ):qe J2(£lc) with jltg = jU} if jU < P, 
oo otherwise. 



I oo 

Inside the braces the familiar relative entropy is 



/q ( vt n ) 
£g(T?, S+T]) log ; ' ' . H(drj). 
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H^f : [0,°o] is convex. (The argument for this can be found at the end 

of Section 4 in [32].) For measurable functions g on £l[ define 

(2.4) H? >P (g) = sup {£"[nmifcc)]-%(/l)}. 

p€uri(Q/),c>0 

For g from the space of bounded measurable functions (or bounded continuous 
functions if Q. comes with a metric) Hf F (g) is the convex dual of fl^p, and then we 
write Hp ¥ (g). The constant R = max{|z| : z £ appears also frequently in the 
results. 

For the rest of the section we fix I > and consider measurable functions g : 

&e ->■ R. 

Theorem 2.3. Assume & is countably generated. Let g € =Sf. Then, for ¥-a.e. ft) 
the limit 

A e (g) = Umn- l logE \e r ^ 8{Tx K w > Zk+lJ ' + < ) } 
exists, is deterministic, and satisfies Af (g) = Kf (g) = Hf p (g). 
Remark 2.4. The limit A((g) satisfies these bounds: 

ra-l 

< A^(g) < lim max n^ 1 V max g(7; ft),zi ^). 



(2.5) E 



mm g(ft>,Zi,^ 



K;<n 



The upper bound is nonrandom by invariance. The lower bound comes from er- 
godicity of the Markov chain Tx n (0 (Lemma 4.1 in [32]) and Jensen's inequality: 

-1 r 
n~ l logE [e I '^ g ^ co ' Zk+1 ' k+e '] >« _1 ^£ mmg(T Xk (0,z.i, t 

If g is unbounded from above and M allows the walk to revisit sites then a situation 
where Ac(g) = 00 can be easily created. Under some independence and moment 
assumptions the limit on the right in (2.5) is known to be a.s. finite. 

Remark 2.5. Suppose CI is a product space with i.i.d. coordinates {ft) v } under P, 
the walk is strictly directed (0 does not lie in the convex hull of 3$), and g{- ,zi,e) 
is a local function on £1. Then the assumption E \g(- ,Z\.e)\ p < 00 , for some p > d 
and all zi/ € 3$ , is sufficient for the above Theorem 2.3 and the finiteness of the 
limit Ag{g). That such g G Jzf is proved in Lemma A.4 in Appendix A. Under this 
moment bound, finiteness of the upper bound in (2.5) follows from lattice animal 
bounds [10, 19, 30]. 

Remark 2.6. If & = {±ei, . . . , ±e c i} or 3$ = {e\ ±e2,---,e\ ±e c i} and if we take 
g to be a function of ft) only, then Ao(g) = Ko(g) = H* p (g) corresponds to a dis- 
cretization of the variational formula for the effective Hamiltonian H of the ho- 
mogenized stochastic Hamilton- Jacobi-Bellman equation considered in [27], [28], 
and [29]. It is also related to the variational formula for the exponential decay rate 
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of the Green's function of Brownian motion in a periodic potential; see (1.1) in 
[38]. 

Here is an outline of the proof of Theorem 2.3. Introduce the empirical measure 

Ri = n ~ l LlZlSrx k w,z k+1 , k+l so that nR ( n (g) = Z n k llg{T Xk (0,Z k+XMt ) gives conve- 
nient compact notation for the sum in the exponent. Let 



A((g, ft)) = lim n l logE Q e nR '"^ and A e (g, ft)) = lim n ^og/to 



nR'nis) 



The existence of Ag(g) and the variational formulas are established through the 
inequalities 

(i) (ii) (iii) 
(2-6) At(g) <K f: (g) < Z/*p (g) < Mg)- 

Inequality (2.6.i) is proved in Lemma 2.11. This is the only step that requires 
g € Jz? rather than just L 1 (P). Inequality (2.6.ii) is proved in Lemma 2.12. This is 
where the main technical effort of the paper lies, in order to relax the irreducibility 
assumption on St used in [32]. Bound (2. 6. iii) is proved with the usual change of 
measure argument. It follows as a special case from Lemma 2.15 below. The proof 
of Theorem 2.3 comes at the end of this section after the lemmas. To improve the 
readability of this section some lemmas are proved in an appendix at the end of the 
paper. 

Remark 2.7. Suppose lies in the relative interior of the convex hull of M. Then 
for every x G there exists z\ >n G & n with x n = x (Corollary A.3). Under this 
irreducibility the approach of [32] becomes available and can be used to prove 
our results under the assumption that g(-,Zi ! e) £ L p (¥) for some p > d and all 
Zi t e € St . In this case (2.6.i) is proved via a slight variation of Lemma 5.2 of [32] 
rather than our Lemma 2.11. This relies crucially on Lemma 5.1 of [32] which is 
where p > d moments are required. We replace this with the much weaker Lemma 
2.9 which only requires one moment, but then we need Lemma 2. 1 1 which requires 
g£^- 

We turn to developing inequalities (2.6). Decomposing the free energy accord- 
ing to asymptotic directions E, turns out useful. Let % be the (compact) convex 
hull of St in M. d . For each rational point E, € % fix a positive integer ) such that 
b(%)% € D b ^ (recall definition (2.1) of D n ). The existence of b(%) follows from 
Lemma A.l in Appendix A. Then fix a path {x n (^)} ne z + , stalling at Jco(^) = 0, 
with admissible steps x n {t,) — x n _i(^) £ 3$ and such that Xjun{B,) = jb(%)% for 
all j £ Z + . Even though stationarity and ergodicity are standing assumptions in 
this section, the next lemma actually needs no assumptions on P. 



Lemma 2.8. LetgeJf. Then for P-a.e. ft) 

" <^t{X n =*„(§)} 



(2.7) limn log£o 

n— 



e n4(8) 



< sup limn 'logiio 
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Proof. Fix a small e > 0, an integer k > \M\e 1 , and a nonzero z € For x G D n 
write x = £ z& ^a z z with a z € Z + and Y,zeM a z = w - Let m„ = \n/(k{\ — 2e))~\ and 



(n) 
Let 



|~&(1 — 2e)a z /n]. Then & 1 £ z 4 < 1 — £ and m n s~"' > a z for each z € 



(») 



(2.8) 



£(n,x) 



-1 



L(n)V 



Then, %(n,x) £ k 1 D^ : . With £ fixed small enough and considering « > k/e, we 
constructed an admissible path of m n k — n< Ane steps from x to m„&<!; (n,x). This 
path has at least m n (k — £4 ) > m„te > «£/(l — 2£) z-steps. Consequently at 
least a fixed fraction 8 of the steps of the path are z-steps, for all x € D n and all n. 

Let b be the least common multiple of the (finitely many) integers {b(^) : E, € 
k~ l Dk}. Now we take another bounded number of additional steps to get from 
m n kE, (n,x) to the path .*.(<!; (n,x)). Pick l n such that (l n — l)b< m„ < i n b. Then by 
repeating the steps of kt, (n,x) in (2.8) l n b — m n < b times, we go from m n kt, (n,x) to 
l n kb£, (n,x) = xe„kb(^ ( w > x ))- The duration of this last leg is bounded independently 
of n and x € D n because k was fixed at the outset and b is determined by k. Thus 
the total number of steps from x 6 D n to xi n kb(£, {n,x)) is r n = l n kb — n< 5ne, for 
large enough n. Let u(n,x) = (u\, . . . ,u rn ) denote this sequence of steps. Again we 
note that at least a fixed fraction 8 of the w,'s are z-steps. 

Develop an estimate: 



-log£ 
n 



-log Y E 



f R ^\x n =x 



< max- log Eq 

xeD n n 



e {n-mUs), Xn =x 



£ \ CXogn 
+ max max -g(T w+v (0) H 

weD„- t y€ui ' d s n n 



< max - log^o 

xeD n n 



J>^**to,X iHik = £ tnbk (S(n,x)) 

21 1 r " 

max max — g(T w+y (0) + max- V g{T x+Ul 



+ 



Clog 71 



+...+„,.©) + -log I 
n 



Above, g((o) = max ;i e ^t \g((0,zi,e)\- The second-to-last line of the above display 
is bounded above by 



max -log£ ,X lnbk =x eM ^) 



+ 



Clog?2 



and so its limsup is almost surely at most 



( 1 + 5e ) sup lim n~ 1 log E e nR '» fe) 1 {X n = x„ ) } 
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The proof of (2.7) is complete once we show that a.s. 

1 r„ 

lim lim max- V g(T x+Ul+ ... +u .Co) = 
e->0 nr-i 
(2.9) '- 1 i 

and lim lim max max -g(T w+y <o) = 0. 

e^Qn^ we D„_ eyeu i s=oDs n 

To this end, observe that the ordering of the steps of u(n,x) was so far immate- 
rial. Because Definition 2. 1 cannot handle zero steps, we need to be careful about 
them. The ratio of zero steps to z-steps is at most t = \d~ l ~\- We begin u(n,x) 
by alternating z-steps with blocks of at most t zero steps, until the z steps and the 
zero steps are exhausted. After that order the remaining nonzero steps of M in any 
fashion z\,z%, ■ ■ ■ , and have u(n,x) take first all its zi steps, then all its za steps, and 
so on. Since zero steps do not shift co but simply repeat the same g-value at most t 
times, we get the bound 

n, >■„ 
£l(7i+ Ml +...+„ i -C0) <t\&\ max max Y§( T y+iz<»)- 

£| yex+u(n,x) zeM\{0}p Q 

By y G x + u(n,x) we mean y is on the path starting from x and taking steps in 
u(n,x). A similar bound develops for the second line of (2.9), and the limits in 
(2.9) follow from membership in Jzf . □ 

The next step is to show (2.6.i): for g G ££ and P-a.e. CO, Ag(g, co) < K((g). The 
following ergodic property is crucial. Recall the definition of the path x.(%) above 
Lemma 2.8. For t, G Q d n % and zi,e, Zi,e G & e define 

Zi/,Zi,e) = {(a u ...,a„) G :zH hZf + aH \-a„-£=x n (£), 

a n -i+l,n = 

This is the set of steps (a\,.. .,a n ) that take TJo = ((0,zi/) to r\ n = (T x jnC0,zi,i) 
via % = 5+ r]i-i, \<i<n. 

Lemma 2.9. Let F G J^. T/ien, for each t, G Q d n % and z.i,e, Z\,t G 0, 

Y n—l 



lim max 

n->°° (ai,...,a„)e^(|,zi*,li^) 



"«=o 



- £ F(Tj/,a i+ i) = in L l (F) andforF-a.e. CO. 



Remark 2.10. Due to the closed loop property (iii) in Definition 2.2, the sum above 
is independent of {a\, . . . ,a n ) G srf n {E, :Z\^,Z\,i )■ In other words, there actually is no 
maximum. Also, Lemma 2.9 holds regardless of the choices made in the definition 
of 

We postpone the proof of Lemma 2.9 to Appendix C. 
Lemma 2.11. Let g G ££. Then A e (g, co) < K e (g) for F-a.e. CO. 
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Proof. By Lemma 2.8 it is enough to show that 



lim n log£o 



<K t Ag) P-a.s. 



e nR ^h{X n =x n ($)} 

n—t-co L 

for fixed § eQ^fl^ and F £ Jfy. Abbreviate r\ k = (T Xk C0,Z k+l;k+£ ). Fix e > 0. 
Lemma 2.9 implies that for P-a.e. CO there exists a finite c e (oo) such that for all n, 
on the event {X n = x n {£, )}, 



£ F(T] k ,Z k+ £ +l ) > -c e -ne. 
k=o 



Therefore, for P-a.e. CO 



n l c e — e + n 'log^o 



l{X n =x„(Z)} 



< n log Eq 



<n l logE Q 



,n-\ 



exp { £ (g(Tfc) +F(T7,,Z, + , +1 )) }l{X„ = 
K k=0 K ' } 

n-\ 

ex p|L \8(nk)+F(nk,Zk+i+\ 



*k=0 

rfl-2 



n l \ogE a exp|£ \g{j)k)+F{r] k ,Z k+l+ i 
< n- l K ( >f(g) +?2 _1 log£'o 



l)}i?o[< 



,g('7o)+f(J?o,Z/+i) 



ex p{L (s(n*)+* , ( T i*> z *+m))} 



<---<^,fU). 
The claim follows by taking n / « and then e \ 0. 



□ 



We have shown (2.6.i) and next in line is (2.6.ii). The following lemma is the 
most laborious step in the paper. 

Lemma 2.12. In addition to ergodicity assume now that & is countably generated. 
Assume g( - ,Zij:) £ L l (P) is bounded above. Then 

K e (g)<Hl P (g)= sup {E^-HtrQi)}. 

Proof. We can assume Hf F (g) < oo. The first technical issue is to find some com- 
pactness to control the supremum on the right. Assuming Q. compact would not be 
helpful because the problem is the absolute continuity condition in the definition 

ofH e:F (n). 

Fix a sequence of increasing finite algebras 6 k on Q. that satisfy T± z & k -\ C 
& k for all k € N and z € and whose union generates (3. Let = j%\(€li) 
be the set of probability measures \i on £2/ for which there exist ©^-measurable 
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Radon-Nikodym derivatives (p Zl t on £2 (with respect to P) such that for bounded 
measurable G 

f Gdn= £ f ^ lf ((o)G(co,z u )ndco). 

Such /I satisfy jito <C P and so 

//| P (g)= sup -^,p(m)} > sup {E^-HipQi)}. 

Abbreviate A = Hf ¥ (g). The proof is completed by verifying this statement: 

(2.10) if A > sup {EV[g]-H t y>(n)} for all k > 1, then A > K e (g). 

Let a denote a generic probability measure on Clj with marginals 0C\ and a2 
and let bCi( denote the space of bounded measurable functions on £l(. 

A> sup {E^[g]-H(nxq\ixxp e )} 
sup {E a '[g]-tf(a|aix&)} 

a:a\^.W\ ,a.\=a,2 

(2.11) = sup inf {£ ai [g]+£' o:2 [/i]-£' 0!l [/i]-//(a|ai xpM. 

Let F denote a bounded measurable test function on Q\. is the set of proba- 
bility measures a on ilf of the form 

[ 2 Fda= 1 £ f a 1 (rf7]) 9 (T J) 5+7])F(T ] ,5+T ] ) 

where ai e and kernel q(r\,S^r\) = q((co,z\,e), {T ZI w,Z2,ez)) is ©^-measurable 
as a function of ft) for each fixed (zi £,z)- A measure a € is uniquely repre- 
sented by a finite sequence (<j>i,z\ e ^i,zie,z) y i a tne identity 

(2.12) [Fda= £ totuVteutf F({(o,z u ),(T Zl (o,Z2,ez)W(o) 
Ja i i,zi,e,z Ja < 

where {A,} is the finite set of atoms of such that P(A,-) > 0, 0/ )Zl ( is the value of 
<j) Zie ((o) for ft) € A,-, and <?;, Zu ,, z is the value of q(ri,Sfri) for ft) £ A;. Thus is 
in bijective correspondence with a compact subset of a Euclidean space, and (2.12) 
shows that via this identification the integral is continuous in a for any F that is 
suitably integrable under P. Similarly the entropy 

i,zij,z 

is continuous and convex in a. 

Turning our attention back to (2.1 1). Once we restrict a to the compact Haus- 
dorff space jtf* the expression in braces is upper semicontinuous and concave in 
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a and convex in h. We can apply Konig's minimax theorem ([26] or [31]), and 
continue as follows: 

A> sup inf {E a> [g}+E a2 [h]-E a> [h]-H(a\ai x pM 
= inf sup {E at [g]+ E a2 [h]-E ai [h]- H (a \cdxpM 

= inf sup £ / ¥(dco)(t> Zu ((o) 

x[^ q (7 1 ,St7 1 )(g(7 1 )-h(7 1 )+h(Str 1 ))-H( q (7 1 ,-)\p e (7 1 ,-))} 

= inf sup Y / ¥(d(o)<p, u (co) 

x {E?(l.S+l)E[g(rj) - A(rj)+A(,S+ri) 1 6,] -ff (?(ri, ■)[&(»?> ■))} 

= inf sup V [ F(d(0)^ lf ((0) logYie^n)-h(T,) + h(St n )\6 k ]_ 
h ^ ba <ne.^ zZ Jsi z 1 1 

Above we introduced the densities ZU (©) and the kernel g that correspond to 
a € > use d ©/t-measurability to take conditional expectation, and then took 
supremum over the kernels q with the first marginal ot\= jJ. fixed. This supremum 
is a finite case of the convex duality of relative entropy: 

sup{£,7(z)v(z)-£^)log^4}=log£p(^W 

and the maximizing probability is q{z) = (X,xP{ x ) e )~ l P(^) e ■ m our case 
v(z) = E[g(rj) — 72(77 ) + H^t^) I so the maximizing kernel is (3/t-measurable 
in ft) and thus admissible under the condition a € ^£ ■ 
Performing the last supremum over \i € gives 

A> inf max P-ess supllogV 1 ^(t,)-a(»j)+/.(^)|6*]\ 

Consequently for £ > and & > 1 there exists a bounded measurable function ft^.e 
on £if such that for all zi t £ ^ and P-a.s. 

(2.13) A + log |^| + £ > log^te^-M^+M^) |6*] _ 

z 

For integers < i < k define 

(2.14) F^ e (T],z)=E[h kje (S+T])-hk,e(n)\&k-i]- 

We next extract a limit point in Jtfg. The proof of the following lemma is given 
in Appendix C. 
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Lemma 2.13. Assume & is countably generated and A < oo. Construct Fjfl as in 
(2.14). Fix £>0and let g((0,zu) G L 1 (P)for all z\,t G S% 1 ■ Fix e > 0. TTien, as 
k — > oo, along a subsequence that works simultaneously for all z\i G M , 
and i > 0, one can write 



w/f/i F^ £ (ri,z) converging in weak L l (P) to a //mzY F e w arcc? f/ze error terms (0 i— ^ 
/?|g((D,Zl^,z) > &k-i-measurable and converging to P-a.s. Furthermore, as 
i —7- oo, converges strongly in L 1 (P) to a Z/m/? F e , 

c(z)=E[F e (ffl,(z,z,---,z),z)]>0 
/or all J, am/ F e {r\,z) = F E (r\ , z) — c(zi) belongs to class 

Fix / > for the moment. As a uniformly integrable martingale, Mk{f]) = 
E[g(r]) | ©jt-i] converges as /c — > oo to both a.s. and in L l (P) for all z.\i and z. 

Fix zi/ and z. The weak-L 1 (P) closure of the convex hull of {My + Fjl : j > k} 

is equal to its strong closure (Theorem 3.12 of [37]) . Since g(ri) +Fg l \r],z) is in 
this closure, there exist finite convex combinations 



J(0 _ V (ui ._i_ f(') 

such that 



E|g(T])+F e W (T],z)-G[; ) e (T],z)| <l//c. 

Along a subsequence (that we again index by k) (r\,z) converges P-a.s. to 
g(Ti)+Fg\r},z). Consequently also 

Along a further subsequence this holds simultaneously for all zi,^ and z. 
By (2.13) and Jensen's inequality, we have for all z\i G 8% and P-a.s. 



^A+logl^l+e > £ E 



e E[g(T ] )-^, E (i,)+ft jfc , e (S+J7)|e t ] 



©t-i 



Since this is valid for all lc > i, another application of Jensen's inequality gives 

e A+log|.«|+ e > y g G«(77, Z )_ 



1 6 F. RASSOUL- AGHA, T. SEPPALAINEN AND A. YILMAZ 

Taking k — > oo implies, for P-a.e. CO and all zi,t G 3& l , 

A + e^^+log^^/'W 

Taking / — > oo implies, for P-a.e. CO and all z\j G 

A + e>g(n) + log£i/^). 

zg,* 

Since c(zi) > the above inequality still holds if F e is replaced with F e . Thus 
A + e> inf max P-ess sup \g(r)) +log Y jh\e F( -' n ^\. 

Taking e — >• gives A > K((g). (2.10) is verified and thereby the proof of Lemma 
2.12 is complete. □ 

Next for technical purposes a Fatou-type lemma for K(. 
Lemma 2.14. Let gu{- ,Zie) — >g(-,Zie) in L x (P) for each z\i G SBr. Then 

K (: (g)< Km K e (g k ). 
k->o» 

Proof. We can assume lim /t ^ 00 ^(g / t) = A < oo. Fix e > 0. There exists a subse- 
quence, denoted again by gk, such that K((gk) < A + e for all k. Pick F k G Jtg such 
that 

(2-15) ftfa)+log£ p^ 1 ^ <A + £ 

for all zi^ G ^ , and P-a.e. CO. Out of this we can produce an F G such that 
(2.16) £(T7) + log£ i^' z) <A + e. 

ZG,* 

This implies ^(g) < A + £ and taking e — > finishes the proof. 

The construction of F is a simplified version of the argument to realize a limit 
point in in the proof of Lemma 2. 12. We sketch the steps. The reader who aims 
to master the proof may find it useful to fill in the details. 

For each k, zi,e, and z, 

z) <A + e-g k {n) + \og\M\. 

Thus F k + is uniformly integrable. Controlling Fj~~ is indirect. Set T]o = (co,zi^), 
Zo = z, at = Zi-i, and r\j = S^fy-i for / = 1, . . . 1. By the mean-zero property 
of Fk (part (ii) in Definition 2.2), 

I i 
E[^-(T],z)] < £E[F,-(T] ; -,a m )] = £E[F+(7],-,a i+1 )] 

r=0 (=0 
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and so E[F]T] is bounded uniformly in k. Apply Lemma C.4 to write Ff = Fk + Rk 
such that along a subsequence F# is uniformly integrable and Rk > converges to 
in P-probability, for each zi,e- Along a further subsequence Fk = F, + — Fk converges 
weakly in L 1 (P) to a limit F and the limits Rk — > and gk — > g hold almost surely. 

In (2.15) write Fu = Fk — Rk- As done in the proof of Lemma 2.12 take almost 
surely convergent convex combinations of Fk, Rk and gk and substitute these into 
(2.15). Taking the limit now yields (2.16) but with F in place of F. 

Almost sure convergence of convex combinations ensures that F satisfies the 
closed-loop property. But it may fail the mean-zero property. To remedy this, 
let c(z) = E[F(tB, (z,z, . ..,z),z)]. By the weak convergence c(z) is a limit of 
E[Ffc(©, (z,z, ■ ■ ■ ,z))Z)], which is nonnegative due to Rk > and the mean-zero 
property for F k . Since c(z) > 0, (2.16) holds with F(t],z) = F(t],z)-c(zi). That 
F satisfies both the mean-zero and the closed-loop property is verified with the 
argument given between equations (C.ll) and (C.12) in Appendix C. The point is 
that the closed-loop property of F allows us to define the path integral / which is 
used in that argument. This verifies that F € Jf( and completes the proof. □ 

Next a large deviation lower bound lemma that gives us (2.6.iii) and serves 
again to prove Theorem 3.1 below. 

Lemma 2.15. Let g{- ,zi,e) € L 1 (P) be bounded above. Then for F-a.e. CO 

(2.17) limn' 1 log ff n 

n— ><*> 

Assume additionally that D. is a separable metric space. Then for F-a.e. 00 this 
lower bound holds for all open O C j&\ {Cii): 

(2.18) limn" 1 log £ n 

n— 

Proof. This proof proceeds along the familiar lines of Markov chain lower bound 
arguments and we refer to Section 4 of [32] for some further details. 

Switch to the Q/-valued Markov chain v\k = {Tx k O0,Zk + \.k + e) with transition 
kernel p( defined in (2.2). Then R e n is the position level empirical measure L„ = 
n l E]t=o fyk- Denote by (with expectation E^) the distribution of the Markov 
chain (j]*)*>o with initial state tj. Starting at 77 = (oo,zi,e) is the same as condi- 
tioning our original process on Z\f. 

E [G((T Xk oo, Z k+hk+e )o<k<n)HZ hi = z\,i}] =j^E n [G(f] , r} n ))- 
Consequently for any zi,e G & l and P-almost every 00, with 77 = (eo,zi : e), 
lim n~ l log^o 

n— >oo 

Next we reduce the right-hand sides of (2.17) and (2.18) to nice measures. A 
convexity argument shows that the supremum/infimum is not altered by restricting 



e nRl{ 8 ) > sup { E ^[g]-H e ^)}. 



e nR »( g h{R e n G 0} 



> 



inf {He 



e nR »( g h{R e n G 0} > Mmn^logE,, e" L " (g h{L n € 0} 
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it to measures /I with these properties: jUo <C P and there exists a kernel q G J2(Clg) 
such that \iq = fi, H(pL x q\[i x p^) < °°, q{f],-) is supported on shifts S+T], and 
^(r; , .S+T/ ) > for all z G ^ and /i-a.e. 17. We omit this argument. It can be 
patterned after the lower bound proof of Theorem 3. 1 of [32] (page 224). This step 
needs the integrability of g(co,zi } e) under P. 

These properties of /I imply the equivalence Ho ~ P and the ergodicity of the 
Markov chain <2tj with initial state r\ and transition kernel q (Lemma 4.1 of [32]). 

Next follows a standard change of measure argument. Let & n be the a-algebra 
generated by (rjo, . • • , T] n ). Then 



n log^,, 



,nL n (g) 



t{L n G 0} 



> n 1 log - 



E Qn 



dQ n 



dP, 



e nL M\{L n G 0} 



Q v {L n £0} 



+ n- l \ogQ v {L n eO} 



,-1 



> 



l0.£ 



dP, 



n 1-^,-1 



Q^LneO} 



Q v {L n eO} 



l E Q n 



+ 



log ,„ 



1{L„ i 0} 



Qr\{Ln G 0} 



EQi[L n (g)l{L n ^O}} 

Q-q{L n G 0} 



Qn{L n £0} 



f ^ML + „-% ga , {i , e0} 



Q v {L n £0} 



n l Er, 



+ 



dQ 



dP, 



log 



dQ 



dP, 



t{L n $ 0} 



EQn{ Ln (g)l{L n (£0}} 

Q n {LneO} 



EQn 



> 



Qn{Ln£0} Q n {LneO) 
{m V g)Q r] {L n iO} 



n- l e- 1 



Qr]{L n G 0} Q v {L n £0} 
where we used jclogjc > — e~ l and 



?7m ptm^tn) 



Since F > by Jensen's inequality and g is bounded above, ergodicity gives the 
limits for /Xo-a.e. ft): 



lim n ^ogiso 



Mb) 



t{R l n eO} >E»[g]-H(llxq\iixp e ). 



(For the details of Q-q{L n G 0} — >• 1 see the proof of Lemma 4.2 of [32].) By 
Ho ~ P this also holds P-a.s. □ 
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We are ready for the proof of the theorem. 

Proof of Theorem 2.3. Assume first that g € jSf is bounded above. Then Lemmas 
2.11, 2.12 and 2.15 give these P-a.s. inequalities: 

A e (g, co) < Ki(g) < Ht ¥ (g) < At(g, co). 

Existence of the limit A((g) and Af(g) = K((g) = Hf p (g) follow. 

Next, consider g € Jzf. Lemma 2.14 implies that K((g) = sup c ^(min(g,c)). 
Existence of the limit Ac (min(g,c)) combined with Lemma 2.1 1 implies 

K ( (g) = sup^(min(g,c)) = supA^(min(g,c)) = supA^(min(g,c)) 

c c c 

<Mg)<Mg)<Ki(g). 

Existence of the limit and the equality A((g) = K((g) follow again. For the other 
variational formula write 

K e (g) = sup^(min(g,c)) 

c 

= sup sup {E^[min{g,c)]-H L p(n)}=Hl F (g). □ 

3 Large deviations under quenched polymer measures 

As before, we continue to assume that M is finite and (0, 6, IP, {T z : z € <$ }) is 
a measurable ergodic system where is the additive subgroup of 7L d generated by 
£%. Now assume additionally that Q. is a separable metric space and & is its Borel 
a-algebra. 

Since our limiting logarithmic moment generating functions A^(g) are defined 
only P-a.s. we need a separable function space that generates the weak topology 
of probability measures. Give Cl( a totally bounded metric and let ilb(£l(;) be the 
space of uniformly continuous functions under this metric. These functions are 
bounded. The space iit,(£le) is separable under the supremum norm and generates 
the same topology on <M\ (CI?) as does the space of bounded continuous functions. 

Given a real-valued function V on Ai define the quenched polymer measures 

where A is an event on environments and paths and 

~ e -nZoV(T Xk a>,Z k+LM )~ _ 

Theorem 2.3 gives the a.s. limit Af(— V) = limn -1 logZj'^. Next we prove a LDP 
for the quenched distributions Q^q{R^ € • } of the empirical measure 

ra-l 

R n = n ~ L $Tx k (»,Z k+u+ r 
k=0 



Z n,0 ~ h 
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Theorem 3.1. Fix £ > 0. Let V be a measurable function on Clp, V G J§f and 
Ag(— V) < oo. Then for F-a.e. CO the weak large deviation principle holds for the 
sequence of probability distributions Q^Q{R f n G •} on ^\{Ci.() with convex rate 
function 

(3.1) Il 2/ (V)= sup {E^[ g ]-A e (g-V)} + Ae(-V). 

Rate Iy 2 1 is a ^ so equal to the lower semicontinuous regularization of 

(3.2) Hj w (n) = M{Hef(n) +E»[max(V,c)]+A e (-V)}. 

c<0 

Proof of Theorem 3.1. We show an upper bound for compact sets A: 

(3.3) TmTrc-'loge^K, e A}< - Mjl 2/ (n), 
a lower bound for open sets G: 

(3.4) hmn-Hoge^^ G G} > - MHjM, 

and then match the rates. 

By Theorem 2.3 and separability of 11^,(0^) we have P-a.s. these finite limits 
forallgGil fo (n £ ): 

lim n' 1 log£ e -™ [e nR " {g) ] = A e (g -V)- AA-V). 

n— yoo 

(3.3) follows by a general convex duality argument (see Theorem 4.5.3 in [11] or 
Theorem 5.24 in [31]). 

Lower bound (3.4) follows from Lemma 2. 15 and a truncation: for — oo < c < 

n x log Q V n *{Ri G 0} > /T 1 log£ [ e -^(™*(^)) l{ ^ G 0} ] 

-n-^og^o^^]- 

(3.4) continues to hold if Hj ¥ is replaced with its lower semicontinuous regulariza- 
tion H,p*(fi) = sup B inf v£ 5//|'p(v) where the supremum is over open neighbor- 
hoods B of jJ.. 

Theorem 2.3 implies that for g G iXb(£lg) 

A e (g-V)-A e (-V)= sup {E^[rmn(g-V,c)]-H^(n)-A e (-V)} 

ne^t\(di),c>0 

sup {E^[g-m a x(V,c)]-H ei¥ ^)-A e (-V)} 
= sup {E^lg] -Hj P (n)}. 

Another convex duality gives I^eif 1 ) = ^fp*(M) because the lower semicontinu- 
ous regularization i/^ p * is also equal to the double convex dual of Hj F . □ 
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Next we record the LDP for the quenched distributions of the empirical process 
K = n ~ l 1*1=0 ^Tx k m,Z k+l ^ 

Theorem 3.2. Let V be a measurable function on some £le with V £ Jz? and 
Ai (—V) < oo. Then for F-a.e. ft) the weak large deviation principle holds for the 
sequence of probability distributions Q^'q{R^ £ • } on j$\{£l x ^ N ) with convex 
rate function f q? {\l) = sup^/^01^.). 

Proof. This comes from a projective limit. Formula (3.1) shows that I^eti* ° 
Ye+ij) < 7 ^2/+i(m) for M £ -£i(Qe+i) where Ji + \^ : £l £+ i -> is the natural 
projection. Since weak topology of M\ (£2 x ^ N ) can be generated by uniformly 
continuous functions, a base for the topology can be created from inverse images 
of open sets from the spaces (Cii). Apply Theorem B. 1. □ 

In one of the most basic situations, namely for strictly directed walks in i.i.d. 
environments, we can upgrade the weak LDPs into full LDPs. This means that 
the upper bound is valid for all closed sets. Strictly directed means that there is a 
vector u £ W 1 such that z ■ u > for all z £ 3%. Equivalently, does not lie in the 
convex hull of 

Here is the setting. Let Y be a Polish space. Set Q. = T 2 '' with generic elements 
ft) = ((0 x ) xeZ d and shift maps (T x co) y = (O x+y . Assume that the coordinates {ft) v } 
are i.i.d. under P. 

Theorem 3.3. As described above, let P be an i.i.d. product measure on a Polish 
product space £1 Assume that does not lie in the convex hull of M. Let V be a 
measurable function on some V € j£f and assume that A( (— /3 V) < oofar some 
j8 > 1. Then for F-a.e. ft) the full LDP holds for the sequence of probability distri- 
butions Q^q{R^ £ • } on ^\ (Q. x ^ >N ) with convex rate function described in 
Theorem 3.2. 

Proof. A/;(V) < oo by Jensen's inequality. Due to Theorem 3.2 it suffices to show 
that the distributions (^ h q{R^ € • } are exponentially tight for P-a.e. ft). Suppose 
we can show that 

(3.5) distributions Po{R„ € • } are exponentially tight for P-a.e. ft). 

From the lower bound in (2.5) and the hypotheses on V we have constants < 
co,ci < oo such that, for P-a.e. ft), 

Eo[e- nR '" {v] } >e- Con and E [e- nR " (pv) } < e c ^ n 

for large enough n. Fix ft) so that these bounds and (3.5) hold. Given c < oo, pick a 
compact Acl,(Qx ^ N ) such that P {R~ £ A c } < e -P{c*+ci+c)n/{P-\) for large 
n. Then 

QnfiiK eA c }<E [e-<^]- l E [e-"^ v ^ l P {R: £A C } 1 -^' < e — . 
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Thus it suffices to check (3.5). Next observe from 

¥{co : P Q (R~ G A c ) > e-" 1 } < e cn P(R~ € A c ) 

and the Borel-Cantelli lemma that we only need exponential tightness under the 
averaged measure P = ¥ ®Pq. As the last reduction, note that by the compactness 
of M n it is enough to have the exponential tightness of the ^-distributions of R® = 

The exponential tightness that is part of Sanov's theorem gives compact sets 
{U,, ux : m 6 N,x G Z d } in the state space F of the co x such that 



\n- x >e~ m -\ x \) < e -»(»+W). 



n-i 
I 

Here {y^} are any distinct sites. Define 

H m = {2£4 (ft) : Vx G Z f/ 2{w : W Y £ [/^} < e - {m+ ^} 
and compact sets 

= Pi H m 

m>£(b) 

where I = £(b) is chosen for b G N so that 

£ £ e -C*+W)<i. 

m>l-b x 

Now 

m>£(£) m>f(6) ^ 

< I lP{«- 1 I%,(^)>e-'»-W}<^». 

The crucial point used above was that under the assumption on 3? the points 
{X n } of the walk are distinct (Corollary A.2), and so the variables {(Q x +x„} are 
i.i.d. under P. This gives the exponential tightness of the P-distributions of = 

Remark 3.4. For exponential tightness the theorem above is in some sense best 
possible. Theorem 3.3 can fail if lies in the convex hull of St. Then a loop is 
possible (Corollary A.2). Suppose the distribution of (Oq is not supported on any 
compact set. Then, given any compact set U in T, wait until the walk finds an 
environment co x U, and then forever after execute a loop at x. 
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4 Large deviations for random walk in random environment 

This final section before the appendices is a remark about adapting the results 
of Section 3 to RWRE described in Example 1.2. Continue with the assumptions 
on (ii,e,P,{r z : z G Sf}) from Section 3. Fix any I > 1 and let V((0,zi,t) = 
— log7To,zi (o) to put RWRE in the polymer framework. Then Ai(— V) = — log|^|. 
The necessary assumption is now 

(4. 1) | log 7T ,z | G -S? for each z € ^. 

The commonly used RWRE assumption of uniform ellipticity, namely the exis- 
tence of K > such that P{7To, z > k} = 1 for z£ implies (4. 1). 

Under assumption (4.1) Theorems 3.1 and 3.2 are valid for RWRE and give 
quenched weak LDPs for the distributions Pq{RI G • } and Pq{R™ G • }. Note 
though that for t > 2, Qn'oi^ G B} is not exactly equal to P®{R c n G B} because 
under Q^'® steps Z# for k > n are taken from kernel p. This difference vanishes in 
the limit due to log 7Eo iZ (ffl) G L 1 (P). These LDPs take care of cases of RWRE not 
covered by [32], namely those walks for which does not lie in the relative interior 
of the convex hull of £%. 

For RWRE the rate function /J 2 ( in Theorem 3.1 can be expressed directly as 
the lower semicontinuous regularization of an entropy. Indeed, let V{(0,z\i) = 
— \og7ro tZe (T Xe i G)). The difference between using potential V and potential V is 
only in finitely many terms in the exponent. Thus A((g — V) = Ag(g — V) for 
all g G ilb(Clf). Then (3.1) shows that I^ 2 i = *qit ^ ne l atter va ^ e i s ^ e lower 
semicontinuous regularization of Hj v in (3.2), which itself equals H^y from (2.3) 
with pi replaced with the kernel p + (rj,StTj) = 71q^(T X( co) of the Markov chain 
{Tx k (0,Zk+\,k+d under P®. By Lemma 6.1 of [32] same is true of the level 3 rate 
/Jo under the additional assumption that £2 is a compact space. We refer to [32] for 
this and some other properties of l v qV 

If Q. is compact, these weak LDPs are of course full LDPs, that is, the upper 
bound holds for all closed sets. For RWRE with finite 8% the natural canonical 
choice of £2 is compact: in the setting of Example 1.2 take £1 = with generic 
elements ft) = (co x ) xeZ d and p((o) = (Oo projection at the origin. 

If Q. is compact we can project the LDP of Theorem 3. 1 to the level of the walk 
to obtain the following statements. The limiting logarithmic moment generating 
function 



(4.2) 



A(0 = lim-log£' a, [^"] 



t €R d 



exists a.s. Its convex conjugate 



r(C) = sup{c-*-A(0}, 



f G M. d , 



teR d 
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is the rate function for the LDP of the distributions P®{n~ x X n £ •} on W ! . For 
walks without ellipticity, in particular for walks with ^ % , even this quenched 
position-level LDP is new. It has been proved in the past only in a neighborhood 
of the limiting velocity [48]. 

In the following appendices we invoke the ergodic theorem a few times. By that 
we mean the multidimensional ergodic theorem; see for example Theorem 14.A8 
in [20]. 

Appendix A: Some auxiliary lemmas 

In this appendix Si is a finite subset of <S the additive subgroup of U 1 
generated by Si, and the convex hull of Si, in W l . 

Lemma A.l. Let t, € Q d f) % . Then there exist rational coefficients a z > such 
that Y,ze:% a z = ^ an d ^ = L z e,^ a zZ- 

Proof. Suppose first that Si = {z.o, ...,z n } for affmely independent points z.o,---,z n - 
This means that the vectors £i — zo, ■ ■ ■ ,z n — Zo are linearly independent in M. d , and 
then necessarily n < d. Augment this set to a basis {b\ = z\ — Zo, ■ ■ ■ ,b n = z n — 
Zo,b n+ i, . . . ,bd} of M. d where b„+\,...,bd are also integer vectors (for example, 
by including a suitable set of d — n standard basis vectors). Let A be the unique 
invertible linear transformation such that Abj = e, for 1 < i < d. In the standard 
basis the matrix of A is the inverse of the matrix B = [b\,...,b c i], hence this matrix 
has rational entries. 

Now let = Y!l=o a iii be a representation of <§ as a convex combination of 
Zo,--- ,z n - Then t, -zo = L"=i ^{Zi — Zo), and after an application of A, At, —Azo = 
Yli=\ a i e i- The vector on the left has rational coordinates by the assumptions and by 
what was just saidaboutA. The vector on the right is [ai , . . . , a„,0, . . . ,0] r . Hence 
the coefficients ai , . . . , a n are rational, and so is also Oo = 1 — £" =1 a,-. 

Now consider the case of a general Si. By Caratheodory's theorem, eveiy point 
in the convex hull of Si is a convex combination of d + 1 or fewer affmely inde- 
pendent points of S% [34, Corollary 17.1.1]. Thus the argument given above covers 
the general case. □ 

The next simple corollary characterizes the existence of a loop. 

Corollary A.2. The existence of a loop (i.e. z.i, m £ & m with z\-\ \-z m = 0) is 

equivalent to € 

This corollary expresses the irreducibility assumption used in [32] in terms of 
the convex hull of Si. 

Corollary A.3. There is a path from to each y € & with steps from S$ if and only 
ifO is in the relative interior oftf/. 
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Proof. Each y G is reachable from if and only if — x is reachable from for each 

x G M. This is equivalent to the existence of an identity = x\ H Vx m where 

each Xj is in and each z G M appears at least once among the x,'s. Equivalently, 
we can write as a convex combination of so that each z G Si has a positive 
rational coefficient. Using Lemma A. 1, this in turn is equivalent to the following 
statement: for each zG^, — ez G % for small enough e > 0. By Theorem. 6.4 in 
[34] this is the same as G ri 9/ . □ 

This lemma gives sufficient conditions for membership in class Jz? of Definition 
2.1. 

Lemma A.4. Let (D,, <3,P, {T x : x G be a measurable ergodic dynamical sys- 
tem. Let < g G L 1 (P). Assume one of the conditions (a)-(d) below. 

(a) g is bounded. 

(b) J = 1. 

(c) d > 2. There exist r G (0,°°) ant/ p > d such that E[g p ] < oo and {g o 7^ : 
i = 1, . . . ,m} are i.i.d. whenever be,- — Xj\ > r/or a// / 7^ j. 

(d) d>2. There exist a > d and p > ad/(a — d) such that E[g p ] < 00 and for 
each z. G M \ {0} arac? /arge A: G N 

(A.l) sup |P(AnB)-P(A)P(B)| <k~ a . 

Aeo(g°T x :x-z<0) 
Bea(goT x :x-z>k) 

Then, for each z G £% \ {0} 

y en 

(A.2) lim lim max - V g o 7i +(z = F-a.s. 

Proof. Part (a) is immediate. 

For (b) let 5 G N be such that ^ = {ns : G Z}. Fix z = as and let g = g — 
E(g I J^ z ) where J^. is the a-algebra of events invariant under By r u4 -invariance 
j en 

max -VK(g\,y z )o T js+ias < (e + ± ) max E(g | J^ z ) o Z), P-a.s. 

-n<j<n n " 0<j<a 

By the ergodic theorem 



lim max 

\t\<n 



1 " 

1-1 



P-a.s. 



This limit is not changed by taking a finite maximum over the shifts by Tj s , < 
j < a- 

Part (c) follows from part (d). 

Fix z for part (d). First two reductions, (i) The maximum over x in (A.2) can 
be restricted to a set A n of size I A„| <Cn d - l £-\ at the expense of doubling e in 
the upper summation limit. The reason is that g > and if x' = x + 7'z for some 
1 < j < ne/2, then the 2n£-sum started at x covers the ?i£-sum started at x' . 
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(ii) It suffices to consider a subsequence n m = m? for any fixed 7 > because 
n m+ i/n m ->■ 1 andg>0. 

Since constants satisfy (A.2) we can replace g with g = g — E[g]. Let S x n = 
Y!i=o g T x+ i z . Equation (A.l) and the translation invariance of P imply strong 
mixing as defined by [33]. Then applying Theorem 6 therein with u = n~ h , r large 
enough, and t = 8n/(cr) we get a generalization of the Fuk-Nagaev inequality to 
square-integrable, mean-zero strongly mixing random variables. This implies that 
for fixed a, 8 > 0, f{\S x ne \ > nS} < C{e,8)n Y - b with b = ap/(a + p) > d. By a 
straightforward union bound 

ne . 
P max YgoT x+k >n8\ < Cn^e^PijsLl > nS} < C(e,8)n d - b . 

Along the subsequence n m = m r for 7 > (b — d)~ l the last bound is summable. We 
get P-a.s. convergence to for each fixed e > by the Borel-Cantelli lemma. □ 



For a general ergodic system (a) cannot be improved. For example, take d = 2, 
an i.i.d. sequence {o),-,o}rez, and then set a>,j = tO () o. For z = <?2, n 1 L'jlo | °h:+(0j) \ 
> £ \ (O x \ and consequently the limit in n in (A.2) blows up unless ft),j is a bounded 
process. 

If the mixing in part (d) above is faster than any polynomial, then we can take 
a — > 00 and the condition becomes p > d. Part (c) is close to optimal. If K[g d ] = 00 
then n~ l max| v |<„go T x blows up by the second Borel-Cantelli lemma. Currently 
we do not know if p > d is sufficient in (c). 



Appendix B: Weak LDP through a projective limit 

We describe a small alteration of the projective limit LDP. Let X and 3£j, 
j 6 N, be metric spaces with continuous maps gj : X — > 9Cj and gjj : Xj — > 3£i 
for i < j such that g,- = gjj ogj and g^j = gjj°gk,j- Let {/!„} be a sequence of Borel 
probability measures on 56 ', and define \ll = ^ogj 1 on 56 ). Let// : 56 j — > [0,°°] 
be lower semicontinuous. Define I(x) = sup j I j(gj(x)) for x G 56 . 

Theorem B.l. (i) Suppose that for all j, IjOgj +i j < and Ij satisfies the large 
deviation upper bound for compact sets in 56 j. Then I satisfies the large deviation 
upper bound for compact sets in 56. 

(ii) Assume that % = {g^ l {Uj) : j G N, Uj C 56 j open} is a base for the topol- 
ogy °f 36 '. Suppose that for all j, Ij satisfies the large deviation lower bound for 
open sets in 56 j. Then I satisfies the large deviation lower bound for open sets in 
56. 
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Proof. Part (ii) is straightforward. We prove part (i). Let A C if be compact. 
Since gj(A) is compact in 3£j and gj l (gj(A)) 5 A, 

Rm n~ 1 log \l n (A) < Rm ?i~ 1 log /z> (A) ) < - inf Ij (y) 
= -MIj(gj(x)) 

from which 

lim rT 1 log \i n (A ) < — sup inf (g : (x) ) . 

j xeA 

Next we claim a minimax property from the assumption of monotonicity: 

(B . 1 ) sup inf Ij (g j (x) ) = inf sup Ij (g j (x) ) = inf I(x) . 

j xeA xeA j xeA 

Inequality < is obviously true. To show >, let c < inf T& 4 sup j I j(gj(x)). Then 
each x € A has an index j(x) such that Iu x \ {gj( x ) (x)) > c. The set D x = {z G 3£ : 
Ij(x)(8j(x)(z)) > c l i s open by the continuity of gj and lower semicontinuity of Ij. 
Cover A with finitely many: A C D X1 U • • UD Xk . Fix j > j(xi) V • • • V j(x k ). Then 
if a; G A pick £ such that * G D Vf , and we have 

Ij(gj{x))>Ij {xt) (gj J{xf) (gj{x))) =I j{xe) (gj {xe) {x)) >c. 

Thus inf xE a I j(gj(x)) > c. We have proved (B.l) and thereby the upper large devi- 
ation bound for A. □ 



Appendix C: Proofs of Lemmas 2.9 and 2.13 

Standing assumptions in this section are the same as in Section 2: (£2, <5,P, {T z : 
Z G W}) is a measurable ergodic dynamical system and, as throughout the paper, M 
is an arbitrary finite subset of Z, d that generates the additive group . Throughout 
this section i > is a fixed integer. C denotes a chameleon constant that can change 
from term to term and only depends on £, and d. In order to avoid working on 
a sublattice, we will assume throughout this appendix that M generates II as a 
group. This does not cause any loss of generality. The additive group <£ generated 
by S% is linearly isomorphic to 7L d for some d' < d [40, p. 65-66] and we can 
transport the model to 7L d . 

A crucial tool will be the path integral of a function F G J^. The main idea is 
that due to the closed loop property these functions are gradient-like. 

For ^-tuples z\,i,zi.i G we write xg = zi + ■■■+ze and X£ = zi + . . . +zi- We 
say that there exists a path from (y,zi,i) to (x,zi/) when there exist a\, . . . ,a m G S% 
such that the composition 5+ ° • • • takes (T y (0,zi/) to (T x co,zi^) for all (O G £2. 
This is equivalent to the pair of equations 



y+X£ + ay-\ \-a m -? = x and a m ^ e+Lm = z.[j- 
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For any two points (x,zi,e) and (x,zi,e) and any zi,£ there exists a point y G 7L d 
such that from (y,Z\,e) there is a path to both (x,Zi,e) and (x,zi,e). For this, find first 
d\ , . . . , dm-i and a\ , . . . , a n -e G & such that 

x — x= (a\ H h a OT -^) - («i H 1- <ki-l) 

so that 

y' =x—[a\-\ h a m _<?) = x-(aH h a„_^) 

and then take y = y' — x$. By induction, for any finite number of points there is a 
common starting point from which there exists a path to each of the chosen points. 

Now fix a measurable function F : Sl^ x & — > K that satisfies the closed loop 
property (hi) of Definition 2.2. If there is a path (ai)f =l from (y,Zi t e) to set 
7]o = (T y (0,zi,t), Vi = S%T]i-\ for i = 1, . . . ,m so that T] m = (r v a>,zi^), and then 

m— 1 

(C.l) L(w,(j,zi / ),(x,z 1/ )) = £ F(7] ; -,a i+ i). 

By the closed loop property L{oo, (y,Zi,e),(x,zi,e)) is independent of the path cho- 
sen. We also admit an empty path that gives 

L(<a,(x,zi,e),(x,zi,e)) =0. 

If ai, . . . ,a m work for (y,zi,e) and (x,zi,e), then these steps work also for (y + u,z\,e) 
and (x+u,zi,e). The effect on the right-hand side of (C.l) is to shift CO by u, and 
consequently 

(C.2) L(T u oo, (y,zi,e), (x,zi,e)) = L(oo, (y + u,z\/), (x + u,zi,e))- 

Next define /:Qx^xZ J ^lby 

(C.3) f(oo,z\,£,z\j-,x) = L(oo, (y,zi,t),(x,zi,e))-L(<o, (y,zi,i),(0,zi,i)) 

for any (y,Zi,e) with a path to both (0,zi^) and (x,zi,e). This definition is indepen- 
dent of the choice of (y,zi,e), again by the closed loop property. 
Here are some basic properties of /. 

Lemma C.l. Let F{- ,zi,e,z) G L x (P) for each (zi/,z) and satisfy the closed loop 
property (iii) of Definition 2.2. 

(a) There exists a constant C depending only on d, t, and R = max{ \z\ : z G 
such that for all Z\j,Z\j G x G U 1 , and W-a.e. 00 

\f(co,zi/,Z\j,x)\< V max max\F(T b co,zi,e,z)\. 

b-.\b\<c(\ x \+i)" z ^ Ml zeM 

In particular, f G L 1 (F)for all (zi,e,Zi t e,x). 

(b) For Zi,e,Zi,e,Zi,e G ffl , x,x G andF-a.e. 00, 

f(oo,zi,e,zi,e,x) = f(oo,zi/,zi,e,x) + f(T x oo,zi,e,zi/,x-x). 

(c) Assume additionally that F satisfies the mean zero property (ii) of Defini- 
tion 2.2. Then for any z.\,t G ffl and x G If 1 , K[f(oo,zi^e,Zi.e,x)] = 0. 
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Proof. Let be the canonical basis of W 1 . For each 1 < i < d, there exist 

nonnegative integers nf and (afj)"^ from & such that 

e; = af, H h a + , — a7~i — a7 _ . 

Write a; = £f =1 fye,-^ with Z? ; > and € {-1, + 1}. Then, 

i=l/=l i=lj=\ 

One can thus find a y that has paths to both and x that stay inside a ball of radius 
C(\x\ + 1). This proves (a). 

To prove (b), let (y,z\ : i) have paths to (—x,zi.e), (0,zii), and (x—x,zi,e). Use 
the definition of / (C.3) and the shift property (C.2) to write 

f(T x CO,Zl ; i,Zl,£,X-x) 

= L(T x co, (y,zi,e), (x-x,zi,e))-L(T x (o, (y,zi/),(0,zi,e)) 
= L(a>, (y + x,z\j), (x,zi,i)) ~ L(®, (y+x,zi,i),(x,zi ; i)) 
= [L(o>, (y+x,zi,e), (x,zi,i))-L((o } (y+x,zi,e),(0,zi,e))] 

- [L(co, (y + x,zi,e), (x,zi,e))-L((0, (y+x,zi/), (0,z\,e))] 
= f((o,zi,e,zi,e,x) -f((o,zi/,z\,e,x). 

For (c), let y be so that from (y,Zi,e) there is a path to both (x,zi,e) and (0,z\/)- 
Then 

f(a,zi,e,zi,e,x) = L(a>, (y,zi,e), (x,zi,e)) -L(co, (y,zi,i),(0,zi,t))- 

Both L-terms above equal sums £^ F(r];,a, + i) where r/o = (T y G>,zi,i) an d = 
(T u (a,z\ t i) with u = x or u = 0. Both have zero E-mean by property (ii) of Defini- 
tion 2.2. □ 

Remark C.2. Part (b) above shows that / is a path integral of F or, alternatively, 
that F is a gradient of /. More precisely, 

F((D,z\,e,z) =f((Q,zi,e,S+zi/,zi) - f(co,zi/,zi,e,0), 

for all zi/, z~i,e € ffl , and P-a.e. (O. (Sf acts on Sr in the obvious way.) 

Lemma C.3. Let F € J^l. Then, there exists a sequence of bounded measurable 
functions hk : &t — >-M ™c/i ?/i<a? E[|/^(S+tj) — %(tj) — F(tj,z)|] — )■ 0/or a// zi t e € 
^ and zGf. 

Proof of Lemma C.3. Starting with F, denote its path integral by / as above. De- 
fine 

g n ((o,zi,e) = -\^\- e (2n+l)- d £ £ f(a,zi,e,zi,e,x). 
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By part (b) of Lemma C. 1 

f(a),zi,£,zii,x) + f(T x (o,zi,e,zi,e,zi) =/(e>,zi,f,zi,^*+zi) 
= f((o,z.i,e,S^zi,e,zi) + f(T Zl (o,S^ZL(.,zi/,x). 
Consequently, from the closed loop property alone, 
(C.4) g n (S+ri)-8n(ri) 

= \M\-\2n + l)- d £ £ [f((D,z ht ,z he ,x) 

ZijeM 1 \x\<n 

-f(T zl (0,S+zi,e,Zi/,x)] 
= \M\-\2n + \y d £ £ [/(fi),ZM,5+z M) zi) 

Zi, e e,^ f \x\<n 

-f(T x (o,z\,e,zi/,zi)] 
(C.5) =F{7],z)-WA^^n+\)- d £ /(2i«»,z M ,z M ,zi). 

Zi,f&^ \x\<n 

By parts (a) and (c) of Lemma C. 1 and by the ergodic theorem we see that 
F (t] , z) is the L 1 (P)-limit of g„ (S+ T] ) - g„ (t] ) for each z\ / G ^ and z G ^. Finally, 
approximate the integrable g„ with a bounded /z„ in L 1 (P). □ 

Proof of Lemma 2.9. The L ! (P) convergence to follows from Lemma C.3. Next, 
observe that for any a\ M that satisfies the properties in braces in the statement of 
the lemma, the F-sum satisfies 

£ F(i]i,ai + i) = f((Q,zi,e,zi,e,x„(£)). 

!=0 

Consequently the task is to show that n~ l f(co,z\/,Zi/,x n {%)) has a limit P-a.s. 

Recall that the definition of the path Jc.(^) given above Lemma 2.8 involved 
an integer b = b(%) such that bE, 6 Z, d and x, n b(%) = mbE, for all m. Using (b) of 
Lemma C. 1 we have 

(mby 1 f((D,zi,e,Zi,e,mb%) = {mb)~ x /(ffl.zi^Zi^O) 

m— 1 

+ (mby 1 £ f(T jb ^co,zi,e,zi,eM) 

7=0 

and by the ergodic theorem, the right-hand side has a P-almost sure limit. 

Given n choose m n so that m n b < n < (m„ + \ )b. By (b) and (a) of Lemma C.l 

n~ l \f((0,z\,t,Z\i,x n {E,)) - f((0,Zi,e,Zi,e, m„b%) \ 

= n^ l \f{T miih ^(0,z.i,e,Zi/,x n (^)-m n b^)\ 

<(m n b)- l G(T mM co)^0 P-a.s, 

where G(to) = Lc|*|<c(*|S |+i) max. e ^ |F(r x G),zi^,z)| is mL l (P). □ 
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Proof of Lemma 2. 13. Fix £ > for the rest of the proof. From (2. 13) we have that 
for P-a.e. co and for all z\,i G M l and zeJ 1 

F$(ri,z)<C-E[g(Ti)\e k }. 

By the fact that g((0,z.ix) £ L l (P) for all z.\j € & the right-hand side is uniformly 
integrable. Thus so is F^' + = max(f^g',0). 

Let Fj®' = max(—F^ , 0). Observe that by the Tj-invariance of P 

E[F^(co,zu,z)] =E[h k , e (T Zl (0,Stzi,e) -h k , e ((0,zi,i)] 
= E[h k , e (co,Stzi,i)]-nh k 4(o,z u )]- 

Thus F$ satisfies the mean-zero property (ii) in Definition 2.2. Letting T]o = 

((0,zi,e), Zq = z, a-i = Zi-i, and T], = S+T],_i for i = 1,.. .,£+ 1, one has that 

E[F^-(n,z)] < fE^'-^flH.!)] = EE^ 1 ^/,^)] 

i'=0 1=0 
is bounded uniformly in k. We apply the following lemma to extract a uniformly 
integrable part leaving a small error. 

Lemma C.4 (Lemma 4.3 of [28]). Let {g n }n>i be a sequence of nonnegative func- 
tions such that sxvp n E[g n ] < C. Then there is a subsequence {nj}j>\ and an in- 
creasing sequence aj /* oo such that g nj t{g nj < aj} is uniformly integrable and 
gnj^-ignj > aj} converges to in probability. 

By the above lemma we can write F k ^' = F k ^ +R kE such that along a sub- 
sequence Fj® is uniformly integrable and Rj^ > is ©^-measurable and con- 
verges to in P-probability. One can then take a further subsequence along which 
*S = F k°e' + ~ Fk°il converges in weak L 1 (P) to F e (0) and R.j® -> P-a.s. (Uniform 
integrability gives sequential compactness in weak L 1 ; see Theorem 9 on page 292 
of [16].) We will always keep indexing subsequences by k. Now we have the 
decomposition 

(C.6) ff=^-C 

An attempt to check that the limit F^ satisfies the closed loop property runs 
into difficulty because we have very weak control of the errors Rfl and the con- 
ditioning in definition (2.14) damages the closed loop property of the function 
hk,e(SfT]) — h ke {r]). To get around this we denned the family indexed by i in 
(2. 14). In the next lemma we develop a hierarchy of errors obtained by successive 
application of Lemma C.4. We give the proof after the current proof is done. Re- 
call that given zij € £% } , xj = z.\ H hZy. We will use the notation for a path 

of length j = and then xq = 0. 
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Lemma C.5. There exist nonnegative random variables on Clg x ffl, denoted by 

B%i* lJ \ ^e Zl ' J+l \ andR^ ,Zlj) , with < j < i < k and z\,j+\ G & J+l , such that 
the following properties are satisfied. 

(a) 

(b) R%'e Zlj) > ^k'e ZlJ+l) ' andR ke' Zlj) are T Xj & k -i-measurable. 

(c) E[R^ | 6 t _f-i] = ^: 1A0) +4J 1A0) /^ ofl i > 0. 

(d) E[4;f ^ ir^et-w] aV; l/ ! " • <; L/ M ./'"• aai>j>\ 

(e) e[r^ j) | r^et-j-i] = ^ u+MlJ+0 +4'J 1J+1 ' zlJ+l) /^«// i > j > o 

andzij+i €& +l . 

(f) As — >■ °°, rM' Z and R^'g' Zl ' are uniformly integrable and converge 
in weak L l (¥) to a limit Rg' Zl andR^'^' Zl ' , respectively. 

(g) /?£'g' Z converges to P-#.j. aw & — >■ °°. 

(h) One «£w/or ; > 0, zij+i G ands>\ 

_ jjfa.Ui) _| |_^(;'+*j'+i,zij+i) 

(i) For any fixed j >0 and z\,j + \ bothR ( ^ hzl ' ]) andR^ ,hZi ' 1+l) converge 
to strongly in L l {¥) asi^oo. 

The limits as k — s> oo are to be understood in the sense that there exists one subse- 
quence along which all the countably many claimed limits hold simultaneously. 

Fix i > and let k > i. Starting with (C.6), using (a) of the above lemma, and 
applying (c) repeatedly we have the decomposition F, ^ = F^' — R^ with 

^=E[^-^-...-S^\6^ t ] and 

R^ e is ©^-measurable and F^ £ uniformly integrable. (The proof of Theorem 5.1 
in Chapter 4 of [17] applies to the uniformly integrable sequence F, ^ — R^' ^ — 
•..-^f 0) .) One can check by a standard %-X or monotone class argument 
that any weak L 1 (P) limit coincides with the weak limit without the condition- 
ing, namely F e = Fg —Ry ' ' R^' O,0 \ Furthermore, since Ef/?^] is uni- 
formly bounded in k we have 

E[i^> + ... + ^]<E[Rg]<C. 
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Taking i-><»we see that Fg' decreases converging strongly in L 1 (P) to 

F E =F^-^^. 

Then, F e E L 1 (P) satisfies (i) of Definition 2.2. 

Fix a path xq „ with increments in M. Fix integers k > b > j > 0. Recall that 
T± z & s -i C ©j for all z € ^ and s > 1. In particular, G k -b+j D T Xl &k-b+j-\ 3 
■ • • D T Xj Gk-b- Applying (e) and (b) of Lemma C.5 repeatedly one has 



(C.7) 

Thus, 
(C.8) 



(C.9) 



(CIO) 



E 



R 



(b-j) 
k,e 



T X] 6 k - b 



E 



I* 

s=l 



(b-j+s,s,zi, s ) 

k,£ 



T X] &u- b 



+ R 



k,e 



E 



hk,e(T x . +1 <D,Zj+2J+£+l) ~ h,e(T Xj (0,Zj + \J+e) 



E 



E[h k , e (T Xj+l (Q,Zj + 2,j+e+l) 

-h kfi (T Xj (0,Zj+\,j+i) I T- Xj & k -b+j] 
E F t ( * _y) (2i.ffl,z_/ + i J+ f,Zj + ^ + i) & k -b 

E Ft" ^(r r ..a>,Z;+i.;+/>,Z;+/'+l) ©£_/, 



&k-t 



■ E 



©Jfc- 



E 



■ E 



R 



(b-j+l,l,zi) 

k,e 



+ • ■ ■+Rff 1 ' i) (-,z j+ ij +e ,z j+ i +l ) | r t; 6,_J (7i.fi)) 



k.e 



(T Xj (o,Zj+ij+e,Zj + e+\). 



The last equality used (C.7) and the formula K[g \ © k ] o T x = K[g o 7^ | Tljt©^]. The 
two sequences in (C.9) and (C.10) are uniformly integrable and converge weakly 
in L ! (P) (along a subsequence) to F^ ] ' (T x .(0,Zj+ij+e,Zj+e+i) and 

[ K e H J(,Ar ; ft>,Zy + ij + f,Z ; - + f + iJ, 

respectively. 

For any two paths {?7;}" =0 and {%}™ =0 as in (hi) of Definition 2.2 

n— 1 r . m— 1 r 



£E MCi^-MnO 

!=0 



©*-i 



7=0 



©t-i 
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Now, taking b > max(»i,«) and further subsequences of (C.8) we arrive at 

i=0 5=1 

7=0 i=l 

Here, zi, n and z\ m denote the steps of the two paths corresponding to {t],} and 
{fjj}. Taking b — > oo and applying part (i) of Lemma C.5 we conclude that F e 
satisfies the closed loop property (iii) of Definition 2.2. Next, we work on the 
mean zero property. 

Abbreviate z\i = (z, . . . ,z) G & . Then, 

c{z) =E[F e (ffl,£ M ,z)] =ME[F^((o,z u ,z)] =inf limE[F^(co,z M ,z)] 
>inflimE[F^(to,£ M ,z)] = limE[h k , e {T z a,z u ) -h k 4(0,z. u )} 

i k— >°o ' k— 

Since F E satisfies the closed loop property, one can define its path integral f e as 

above and use (C.3) to write 

(C.ll) 

fe((0,Zi/.,Zi,e,z) = F e (T^ £z (0,Zl,£,z) 

+ F e (T_^_ l - )z (o,zi,e,zi) H <tF e ((o,(z,i,zi/-i),ze) 

- F e (T_ iz co,zi,e,zi) F e (T_ z <o,(z£,zi,£-i),ze). 

Thus, we have c(z) = E[f E (co,zi,e,Zi^,z)} for all zij £ ^ an d z € Hence 

c(z) = |^|" / £ E[/ e (to,z M ,z M ,z)]. 

zi, 

Integrating (C.5) out (with F = F e ) one sees that 

E[g n (S+7 1 )-g n (7 1 ))=E[F e (7 1 ,z)-c(zi)]. 

Since g n (Stri) — g n (il) has the mean zero property (ii) of Definition 2.2, we con- 
clude that F e (ri,z) — c(zi) does too. Let z\,e € 3$, z\. n € ^"\ x = zi H hz n , 

and apply the mean zero property of F £ (t],z) — c{z\) to the path that takes steps 
{z\.n,Z\,d to go from (0,zi,e) to (x+xe,Zi,i). This gives 

n 

(C.12) E[/ e (ffl,z M ,z M ,x+^)] =c(zi) + >-- + c{zt) + Y i c{zi). 

i=\ 

Since the left-hand side does not depend on z\ ,,, as long as the increments add up 

to x we see that c(z\) H hc(z„) only depends on zi H hz n . Consequently, 

F e {r\,z) —c{z\) also has the closed loop property and thus belongs to J^. This 
completes the proof of Lemma 2. 13. □ 
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Proof of Lemma C.5. In what follows, decomposing a sequence Rf. > means ap- 
plying Lemma C.4 to it. The leftmost term in the decomposition is the one that 
converges in weak L 1 (P) topology along a subsequence. Its limit is denoted by 
the same symbol, with k omitted. The rightmost term in the decomposition is the 
one converging to P-a.s. Subsequences are chosen to work for all zij, j > 1, 
at once, and are still indexed by k. Once a subsequence has been given to suit a 
decomposition, subsequent decompositions go along this subsequence, and so on. 
Induction will be repeatedly used in our proof and once an induction is complete, 
the diagonal trick is used to obtain one subsequence that works for all the terms 
simultaneously. Recall that R^ E > and E[/?jg] is bounded uniformly in k. 

The following diagram may be instructive to the reader during the course of the 
proof. Index the columns from left to right by i = 0, 1, . . . ,k and the rows from top 
to bottom by j = 0, 1, . . . , k. 

&k-i ■■■ ©o 
T Xl &k-i ■■■ T Xl & 

T X] & k -i \ 

T Xk &o 

The algebra on row j and column i > j is T x .&^-i- Each algebra on the diagram 
includes the one down and right of it, the one up and right of it, and the one to 
the right of it. The decomposition in (d) corresponds to a step up and right in the 
diagram, while the decomposition in (e) corresponds to a step down and right. 

We will define ^^ J+1 \ and by induction on s = i- j > 

0. On the above diagram, this corresponds to the s-th diagonal starting at &k- s 
and going down to T Xl &ic-s-i and so on. We check property (i) after the whole 
induction process is complete. 

Induction assumption for s: there exist nonnegative random variables on £2^ x 
SI, denoted by R^ Z[J \ ^J' Z[ ' J+[) , and R^ 1J \ with < j <i <k, i- j <s, and 
Zij+i € M 3+ , such that properties (a-h) are satisfied (whenever the terms involved 
have already been defined). 

Set Rff' 0) = R { ® and Rff' 0) = 0. For k > j > 0, zi j+i € observe that 
T x . +1 Gk-j-i C T x .<Bk-j and decompose inductively 



6* ©*_i 6 k -2 
T Xl &k-i T Xl &t-2 

7*2 &k-2 
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For j > and zij+i G set RvJ&j+i) = 0. These are actually never used in 
properties (a-i) of the lemma. This settles the case s = 0. 
Next, for k > 0, decompose 

and for k > j > 1 decompose inductively 

Set £^ e +1 = o for all k > j>0 and € These are again not used 

in properties (a-i) of the lemma. This settles the case 5=1. 

Now fix s > 1 and assume the induction assumption for this s. We will define 
^s+iJw) t $U+*+iJ*iJ + i) t and R U+*iJ*j) by induction on j > o. on the above 
diagram, this corresponds to going along the fixed 5-th diagonal. 

Induction assumption for j with s > 1 fixed: we have defined /Jr^ s+1 '''' Zl , 

Rj/^ s+1 'j' Zl ' J+1 , and J '' ZI J \ such that properties (a-h) are satisfied (whenever 

the terms involved have already been defined). 

Observe that &k- s -\ C T Zl &k- s and temporarily decompose 

E[< e °' 0) | 6 fc _ s _!] = /C' " " • AV, '- ,, : ' and 

E[R^|6^ 1 ]=^ 1 '^ ) +^ 1Ao) . 

Let 7? be the smallest of R { ^ ifi ' z>) , z\ G and R^ 1,o,0 \ Use (c) and (e) with 
i = s — 1 and j = to write 

E[% o ' )|e^_ 1 ]+^ e +1 '°' 0) +< e +1 ^ 0) 

= E[«£- 1A0 5 1 6^1] = E[R^ | +% +1 '°' Zl) . 

The above display shows that the differences ^+ 1,0 ' Z| ) _ r an d /^( J + 1 ' ' ) _ are 
uniformly integrable. Redefine all the terms R^t 1 ' , Z\ G ffl, and /?( A + 1,0 ' ) to 
equal 7? and redefine ^ e +1 '°' 0) to equal /^ r 1 "' ' +R^ m -R and S^ 1 ' ^ to 
equal 1 ,0 ' Zl ) + gjg 1 fi ' zi ) - /?. The upshot is that one can assume that 1 Azi ) = 

R (s+ 1,0,0) for aU ^ G ^ Taking k 

— >■ oo in the above display verifies (h) for j = 0. 

This starts the induction at j = 0. 

Now we go from j to j+ 1. Temporarily decompose 

E ^u+*+u*u) | r Vj+l6 ,_ ; ._,_ 2 ] = £0^+2j+i w ) +/? a+s + 2j + i, zlJ+l) and 

(C.13) 

Tr7\nU+ s + l ,j+ 2 >Zl,j+2) \rp ff, I nO'+S+2,./+l,Zl j+l) , p(y'+i+2J+l,Zij +2 ) 
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Then, one has 



E[R 




k,e \ 1x h 
(j+s+2,j+l,zij+i) 




V \T Xj+l 6k-j. 



j-s-2. 




(j+s+2,j+l,zi tj+2 ) 



Here is an explanation: use (c) first twice to condition R( s l >°> ) on &k- s and then 
on Gk-s-i- Next, use (e) conditioning r( s+1 >°> ) on T Xl <B k - s - 2 then /?( i + 2 . 1 ->) on 
T X2 &k-s-3 and so on until conditioning R\ s+ j +1 ^^J> on T x . +l &k- s -j-2- On the 
other side use (e) conditioning 7?( s_10 ' ) on T Xl &k- s then R^' 1 '* 1 ' conditioned on 
T X2 <5k- s -i and so on until R( s+ j'j +1 ' Zl J+^ is conditioned on T Xj+2 &k- s -j-i- Then 
use (C.13) and condition R(s+j+hj+2,z h j +2 ) on T Xj+i e k - s -j- 2 . 

Now, repeating what we have done for the case j = 0, we can assume that 



n (J+ s+ 2, j+ l w ) =R U+ s+ 2J+l, zlJ+l ) for ^ z ^_ +2 e ^ ; - +2 _ Tak . ng ^ ^ ^ ver . fies 
(h). 



We have achieved the induction on j and thus also the induction on s. Our 
construction is thus complete once we prove it satisfies (i). Using (h), this follows 
easily by induction on j > once one shows that R^ 0,0 — > strongly in L ! (P), 
which itself follows from the fact that E[r[ 1 £ °' 0) + ■■■ +^f £ °' 0) ] < E[rJJ] is uni- 
formly bounded in k. The lemma is proved. □ 
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